Length | Sentence |
---|---|
15 | ダイワ薬品株式会社の株式86. |
15 | 長崎ウエスレヤン大学 ( ). |
15 | ファイル:Id er1-54. |
15 | 土井 1977,pp.4-5. |
15 | 画像:F-14 Tomcat. |
16 | 栃木県総合教育センター ( ). |
16 | ・[https://sites. |
16 | ファイル:MUTCD R3-3. |
16 | ファイル:JNR Kiha80. |
17 | CNN 2009年10月12日付. |
Length | Sentence |
---|---|
14 | 「DJ FUMI☆YEAR! |
15 | ; 「Move on now! |
16 | の元メンバーでCANDY GO! |
16 | 東京国際アニメフェア2008超! |
17 | BS-TBSでは『夏SUNSUN! |
18 | 】 共感のうた(動画元:Yahoo! |
21 | よにんでSUPER☆TEUCHI☆MIX! |
Length | Sentence |
---|---|
15 | 黒の16手目で16.… de? |
Here we see the absolutely shortest sentences in the corpus. In three tables we find declarative, exclamatory and interrogative sentences.
The sentences give some insight into the language or the corpus. Moreover, in the case of malformed sentences they may give hints for better preprocessing.
We find only sentences which were accepted by the preprocessing. For language detection, usually a minimum number of known words is necessary. Because of this, some very short sentences may be missing in the corpus.
select char_length(sentence) as le, sentence from sentences where sentence like "%!" and 40>length(sentence) order by le limit 15;
4.1.2 Sentences of fixed length I
4.1.3 Sentences of fixed length II
4.1.4 Sentences of fixed length III
4.1.5 Longest sentences